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ABSTRACT. 


Herein is presented a Kolmogorov-Smirnov test for grouped 
samples. It constitutes an application of W.J. Conover's 
procedure, which was originally designed for calculating 
exact critical levels for test with discrete distributions. 
This test extends the goodness-of-fit test to samples which 
are grouped into intervals, Critical levels in the two-sided 
case were calcuiated to a close approximation. 

Some examples of the application of this extension of 
the Kolmogorov-Smirnov test are included and comparisons 
with the Chi-square test, the Kolmogorov-Smirnov test for 
rr radoMwermeres orloiolon 2ne@ icr srouped samples are made. 
Comparisons among the three goodness-of-fit tests based on 


the results of computer simulations are made. 
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I, INTRODUCTION 


One of the most important tests ee = Cea 

applications arises in testing hypotheses about the 
(esti rbUuLiTOnN Olea Population. Before specifying a model, 
usually we should look at the data to see if they appear 
to have come from the distribution which we expect to use 
for the model, This can be approached through the histogram, 
which gives us information about the density function of 
the underlying distribution. Another approach is the sample 
distribution function, which gives us an estimate of the 
underlying cumulative distribution function. 

We call a test "a test of soodness-of-fit" if the test 
is concerned with the agreement between the distribution 
of a set of sample values and some theoretical distribution, 

Much work has been devoted to finding test statistics 
whose distributions do not depend on parameters in the 
distribution of the underlying population, Such tests are 
commonly called distribution-free tests. 

One of the most well-known and useful goodness-of-fit 
tests is the Kolmogorov-Smirnov Goodness-of-fit tests , 
The Kolmogorov-Smirnov test treats individual observations 
separately and thus does not lose information through 
grouping as does the Chi-square test. Consequently, in 
the continuous case, the Chi-square test is frequently less 


powerful than the Kolmogorov-Smirnov test. 





It is also known that the Kolmogorov-Smirnov test is 
conservative when the hypothesized distribution function 
is not continuous. In many situations, observations from 
a continuous distribution are grouped. However, studies 
of the modifications of the Kolmogorov-Smirnov test for 
use with grouped data appear to be rather limited. 

The Chi-square test, suggested by Pearson (1900), is 
well suited for use with grouped data, whereas the K-S 
test is for random samples from continuous populations. 
W.J. Conover's procedure was designed to calculate critical 
levels for random samples from discrete distributions, Since 
grouping the data from continuous or discrete populations 
will result in a corresponding underlying distribution 
which is discrete, Conover's procedure can also be used 
for grouped data. The general problem of defining the 
classes or determining the class boundaries in some optimal 
way has apparently received limited attention, In what 
follows we propose a Kolmogorov-Smirnov test for grouped 
data which shows a method of finding (almost) exact 
critical levels and the power of this test. Thus the 
Kolmogorov-Smirnov test may be used as a goodness-of-fit 
test, regardless of whether the hypothesized distribution 
is continuous, or whether the samples are grouped. 

The following listing constitutes the description or 


definition of notation used herein: 





Notation 


K-S test 


Description 
Kolmogorov-Smirnov test for 
continuous distribution function 
and ungrouped data. 


Kolmegorov-Smirnov test for 
grouped data. 


Empirical distribution function 
of a random sample of size n 


Empirical distribution function 
of a random sample of size n 
which is grouped, 

Sample size. 

Critical level of test 

Critical value associated with 
continuous distribution function 


and ungrouped data, 


Cratircal value associated with 
grouped data. 


Hypothesized distribution 
PUNCTiON . 


Some population distribution 
fame bron 


Some fixed real number represent- 
ing the class boundary, 


Null hypothesis 
Alternative hypothesis 


Relative frequency of acceptance 
of null hypothesis. 


Ordered observed random 
sample from distribution Fy 





II, GENERAL DESCRIPTION OF THE TEST 


The Kolmogorov-Smirnov test for goodness-of-fit is 
based on the maximum vertical difference between the 
empirical and the hypothesized cumulative distribution 
Mmunetion. Under es the empirical distribution function 
is expected to be close to the specified distribution 
funnel ion, Fi): If the maximum vertical difference 
between the specified and the empirical distribution is 
not small enough for some x, Say greater than a critical 
value 6, this may be considered evidence that the hypothe- 
Sized distribution is not the one from which the sample 
was drawn, 

Suppose Fast) is the hypothesized distribution and 


SAC ) is the empirical distribution function, then 


Sh (x) = s for x between the kee and Ck 1) st 
largest values in the sample and n is 
the sample size. 

The Kolmogorov-Smirnov test iS: accept 


the hypothesis Ho? eee) ieieimGmwon ly if 


Dd, = Sup 


—- © <X<o@ 


oa een? < 6 , where 








6 is adjusted to give a level a test. This is sometimes 
called the two-sided Kolmogorov-Smirnov statistic as 


Opposed to the one-sided statistics De and De, Where 





Oo 
Il 


Sup { S60) ~ Fo) 


—-DI KC 


Oo 
Il 


9 Sup { FG) - 400} 


It is well know that if X,, X,,---, x is a random 


sample from a continuous distubition function Ley dD. 
twits eeeourton—-tree 1.¢€,; has distribution independent 
of a 

Eevee, +5 ,-—-~, x, be independent random variables 
with the common cumulative distribution function FoCs). 


Furthermore let be the result of 


Gpie@) 9 (nm) 
n independent observations, arranged in order size, that 
is, the n order statistics. Suppose the above random 
samples are divided into k groups, given by class boundaries 
Bah i=ige 2, —-- Kawhere the b,'s are the outcomes 
(observed values) of certain of the order statistic Ney 
Let S? (x) be the sample distribution function based on 
the grouped data. Then SP (x) is a step function which 


jumps at class boundaries and so can be written; 


TORE A ee) 
g al 
S = _— f ) ae < <a: 
n‘*) n a (a - Cee 
al mor x > X 


(n ) 
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For the grouped data, the maximum vertical difference 


between FOX) and S°(x) occurs at one of the class bound- 


aries. The two-sided Kolmogorov-Smirnov test statistics 
for grouped data (K - Sie) Baeiletrc) 1S defined to be; 
[ 
= _ eB 
dD, max FOX) S53 0X) | 





xe{x, -—-x. | 
Jj Jey 


test" statistics are; 


The "one-sided K - 8 
(g) 
D * = max S®(x) - F(x) 
n n Oo 
XE {x, —--X } 
Jy Jee, 
dD, = max { F(x) - sk) } 


iieweawerollows, it will be shown that these three 
statictics are distribution free, provided that Bx) 1s 
continuous. 
Make the change of variables; 
= F (x. 
y a Jae 
wo F (KX. Meolnee fs Cx: 1s 
Fa CR) ore 
nondecreasing, the inequality x, < x. is equivalent to 
a , 
< g ee ee ] 
Fi C44) < a or Y. < y. Thus her = times 


[<arp 


(the number of ee Sox.) = 2 (the number of oe < y) = Hy) 


de Nn 
Hictwemmore, the distribution of “ is; 


Ply,<v]=P[X < x, a Fate) - Y. Thus, 
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5 nas a uniform distribution and Ds can be written as 











> mane 
De max FOX) SOx) 
w(x, -—-x, 
J, Jk-) 
= max Dees (x. ) 
O J; n Ji 
ee ee ee 
g _ 
so that, F (x. - S°(x. 5 - H( 
| o(X5,) ~ Syx5) | y ¥) | 
and dD, = max | F(x) a 5 (%) =" max | \ mee) 
xe | x, --—-X, \ ae 
v1 Iq} yely, Vix 


The expression on the right is the vertical distance between 
the sample distribution of tae 1G aes each U (0,1). 
Since this expression does not depend upon Fo? D is distri- 
bution free. In this case, where the class boundaries are 
emder statistic, D. is distribution free. Massey's procedure (8 ) 
or Davidson's method (4) could be used to calculate critical 
values for this test. 

In the case where the class boundaries are fixed 
constant, dD, is no longer distribution free. It can be 
described as follows; Suppose the class boundaries are 


fixed constants in advance say b,. The value of ee) at 


b; is Fi(b; ). As before, make the change of "variable" 
y = Fi(b;). However, Since b. is constant, y = F Cb; ) 
cannot be transformed to Y = FCB; ). In other words dD, = 


LS 





max |FoC*) = Sn 0%) mas | feG@oe)) 2S), ) 


X€ {b, | 
== max | y - H(y) | . Therefore, it is suggested that 


Conover's procedure be used to calculate critical levels 


for K - Se when the class boundaries are fixed constant, 
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Iii. TEST PROCEDURE.AND CRITICAL LEVEL 


A, TEST-PROCEDURE 

Unlike the Chi-square test and the Kolmogorov- 
Smirnov test, in which critical values are calculated 
which correspond to selected critical levels, with Conover's 
meeceaure one Calculates critical levels which correspond 
to selected critical values, The procedure is as follows, 
ect Xx, , 2 aaa en be a random sample of size n, drawn from 
some unknown discrete population distribution 1 Ge 
The hypothesis is: Ho: = (*) = FOO) where FOC) has all 


parameters specified, The alternative is H,: F,(+) = 


: : : zi Ba 
ao. The test statistic, either Di» D 


ore Ib 
n n 


depending on which is desired, is calculated. Further, the 
critical level is computed and if the value of this critical 
tevel is greater than or equal to that specified in advance 
(usually either 005 or O01), the null hypothesis is accepted; 
otherwise it 1S rejected. 

Let 6 be some fixed real number, o < 6 < 1, The critical 
value of the test, that is P | Di, S 6 | for the two-sided 
Kolmogorov-Smirnov (discrete) test, can be computed by using 


a jrocedure due to W.J, Conover Cre ntaet thas 1S an 


approximation method, since P | D., 2 6 | is obtained by 


calculating P ea > 6 | and P Ea > 6 , where 6 = max 

= = aes + 
Gor; sy and 6 , ao are the observed values of dD, and Do 
respecrively. Pp | Do > 6 | is taken to be approximately 


aS 





approximately Pp }D, ye é | + Pp [Da =, 6 | . To see the 


adequacy of this approximation, we proceed as follows; 


D ee | wel o," 2S) yee > 6)] 


becomes 
P[(D,” > 6) I," > dJ]s P[D,* > Sli = see 
nig n - is i 10 
an approximate value of P dD, = 0 is thus 
pPlp >6Ilx p[p.*>s]+pl[n. 2] . The error 
Peo ! en Lon i 


of this approximation is P | (D," S 3 Gone = 5) | 


which is less than the product P [> > 


But in practice, 6 is taken so that P | D, 


6 | P [Pn » | 


-4- 


p Ea > S| are small and then the product is quite small 


and can be safely ignored. 


For example, 


Suppose both one- 


Sided tests have the same critical level, say, 0.05, then 


P [D, Ze |= -10 with an error less than (0.05) (0.05) = 


0.0025. 
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B, CRITICAL-LEVEU 


Let 6 = max (Fox) - S63) where S40) represents 


=—-O<C kK <0 
the empirical cumulative distribution function of sample 


Gace. 1) 





Figure 1 


6 is the maximum vertical difference between the 


expected cumulative distribution function and the observed 


cumulative distribution function, Draw horizontal lines 
h with intercept je where 0 < j « n(1 -6 ). Then compute 


the value of - =1-%6 + 2). For j=0 the line oe has 


iutercept 6 . The value of f is £951-5. For j=1 the 


1 


intercept of a a (87+) 20000 I iGo) Sit ce. 


i tme horizontal lines oe intersects Ba) ata. Jump 
PO1nNt (eat a point x of discontinuity) the value of f. Ly 2b 


minus the ordinate FOCx) at the top of jump. 


17 





Boeshown by Conover (2), the critical level 


. fe n(1-s_) n n-j 
- [D,, > 6 . ee See (2) 
j=0 J J J 
where 
kel iE 
e. = 1-3 (.) ££, --------------------- (3) 


k 1s defined to be the largest value of the subscript j 
suca that -* > JOR 
Conover's derivation and proof are tedious and are 


omitted here, For example, for k=5 equation (3) becomes; 


Cc =1 

O 
C, = 1-£, 

_ ee z 

Cy = Ini,” - 2f,C 
eee ee” | Sec OC - (BFAC 

3 O “1 2@o 
gee ie = 41°C, - 6f,° - 4£.C, 


To calculate P | D,” 2 a ? where on 


max 1 s,00) - roo}, 
-0<K <a 

draw horizontal lines &. with intercepts 1 - ae + Jy, The 
value of f, is @ = 6 = Jy, but if the horizontal line 

ur intersects BAG) peeoeaump x, then the value of _ is 
the height of FOO) at the bottom of the jump, For this 


critical value, equation (2) becomes: 


risa ) 


P| v,* »s* | = s_ (I Ce (4) 
n 0 ed 


where - is given in equation (3). 


18 





Suppose we have a sample of size n grouped into k 


intervals, having m5 observation for the 5, oa intervals, 


kK 
SO ee m,=n. Let b, be a fixed class boundary of the ,th 


L=1 
interval and SF(+) be the empirical cumulative distribution 
function of the grouped sample. s®(+) is a step function 
which rises by jumps of d. where O<d;< ie 

For example, for 1=1, the jump occurs at by with the 


value of d,- ihestarst jump of 53) occurs at the first 
class boundary. For i=2, the second class boundary is bo 
and at this point the value of the step function is (d, + do). 


So, the step function Sof) can be written as: 


0 Lor << 
5 (1) 
5 = 
Sn Cx) i, d. for b, <a bay 
1 Or > eee te). 
(ny) 
J the number of X » ee: 
where as. = ee) 
‘05 se 


In using Conover!s procedure for eee let So be the 


maximum vertical distance between Ee) and Sy that is 


oe = Max (FOCx) = SC or it can be written 
xe{by+* "by 


(Fig. 2) as; 


=e 7 fox 
- - (Fob; ) S7,(b5 )) 


uo 








Figure 2 


As before, draw horizontal line Me and then compute tne 
values of i The values of C; are calculated by using 
equation (3) and finally the critical level computed by 
equation (4). 

a imam lenotzZe ls inereased, the calculation (if 
done by hand) will be more tetious, so this procedure should 
be used for small sample sizes, say less than 30. Since 
the critical level is a decreasing function of a and bo 
the bigger the value of this maximum vertical difference 
between et ) and SE (+) the smaller will be the critical 
level, and vice versa. For illustration the following is 
an example of how to use Conover's procedure. Suppose a 
random sample of size 16 is drawn from some population FL C+). 


The observed values are (ordered): 
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1,39 1z ,04 14.6 

10,40 12,93 14.26 

NOe42 12,63 14,40 

11,46 bb RS 14,47 

11.49 NG eye 14.84 
13.94 


The hypothesis is H,: X72 (10,20) with alternative 


Hy: X - U (10,20), The samples are grouped into three 


intervals, The values of the class boundaries choosen 
to be b,=12, b,=14 so the three intervals are: (-> -12) 
ono) — 14)eome (14 - ~). (Fig. 3) 
4.---- —-- -—- Se ee ee ee ee SK ee 
§ 
l 
! 
: 
t a 
S7ie) 1 | 84 2.5160 
6875, 
! 
) Ee _ — Fx) 
SOO ie 
i 


-_— 
= 


i 
= 
e=wece Go mS a OS 





Figure 3 


The values of the step function SF (+) at the first class 


boundary is SB (x) = i, (the number of x eee) = -3125, av 
the second class boundary is S°(x) = = (the number of 


Zu 





Pee — soero. Under the null hypothesis, the theoretical 


7m 
rates for these three intervals are te - 10 = ,20 14 - 10 
10 : 10 
= Vigueen 14.84 - 10 
= = cue. = ,484 respectively. 
Here the maximum vertical difference is 1 - .484 


It 


.5160. The de with intercept .5160 intersects eC) 


ab jump, POAant so f peo oe) Ol o, 2 with antercept 
gl 
16 
sof =,,3125 and finally the intercept of 2, is ,6410, 


.9160 + = .5785 also intersects Ba ) at jump point 

Somtee—eeoreo. ihe value of k in formula (3) is equal to 

2eoance Leeantersects OS) aoe ae jum point with top 

Gedinate 1,00, so Lae which is not considered, By uSing 
1 


formula (3) C, is computed and found to be Coal, C. = ,6875, 


Ge 4fem, Thus, P }D, Lae S@roli25 je + et 


(.3125)!5(,6875) 4 for cilgep! (4727) = ,000005, 


— 


Since F (x) is symmetric P [D,” > 5160 | is equal to 
the above, so that P [ D,, > .5160 | = 2 x .000005 = .00001. 
Using a = .0O1 the null hypothesis should be rejected since 
OFOO0001 < 0.01. Incidentally this is a correct decision 
in this case, Since the above random sample was actually 


drawn from a population having distribution U(10, 15). 
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IV, COMPARISON AND THE Sat TEST POWER 

The basic difference among the three tests, the Chi- 
square, the K-S and the ae) is that the Chi-square test 
is sensitive to vertical deviation between the observed 
and expected histogram, the K-S test is based on vertical 
deviation between the observed and expected cumulative 
distribution function, whereas aa) is based on vertical 
deviation between the observed and expected cumulative 
distribution function associated with discrete groups. 
Another obvious difference is that K-S statistic is distri- 
bution free, whereas for Ba)’ the statistic is not 
Gestrrpution free, 


Both Chi-square and K-S require that the data be 


(gz) 
grouped; in contrast, the K-S test does not. Therefore 
when the underlying distribution is continuous the K-S test 
permits us to investigate the goodness-of-fit with informa- 
tion from each observation. By contrast, both Chi-square 
and Pe a lose some information since individual obser- 


vations are grouped into a relatively small number of classes. 


Further, the Chi-square and K-S tests are affected by 


(g) 
the number and the length of the class intervals which are 
choosen arbitrarily by the experimenter. The following is 
another example of applying the cc) which is followed 


by comparison with Chi-square test. Suppose we have a 
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random sample of size n=15, drawn from some population 


FY (+). They are (ordered); 


2.01 3.91 | 14.89 
2.08 5.32 15,27 
2.24 9.09 21,34 
2.52 29.73 
2.70 36.54 

39.43 

40.74 


ime Hypetiiescrs to be tested is that S68) has come from 
exponential distribution with A=6 at a=0.05. 

H: X ~~ EXP (6) 

He Xr EP (69 
Suppose the sample had been divided into three groups 
associated with the intervals (0, 2,70), (2.70, 9.09) and 
(9.09, ~), Under the null hypothesis the expected grouped 
c.d.f. has ordinates; .3624, .7791 and 1 respectively. The 
observed and Beebe cg uenetes forveach group are 
oe 3, ¢ amd 5.456, 6.2505, 3.3135» respectively. 

The maximum vertical difference a OCCUrs aiex — 9709 


S 
are computed as in the previous example and are found to be; 


where 6.= .7791 — .5333 = .2458. The values of i, and C. 
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m= 16376 {5 = .2209 Cal Cruces SUSIE 
im 26376 f, = .2209 C, = .3624 Ge = ay 
ie e209 Ce oa 
f= .2209 Ca el 
f, = .2209 C, = .2335 
f. = .2209 ee 2097 
| = = 2809 oe . 1914 


With those values of we and 25° P| D, eS 2458 | is found 
to be .0395. Using a similar procedure it can be seen that 
Pp [Da Z 2458 | = ,0,0162, so P [D,, > 2458 | is approxi- 
mately 0.0395 + 0.0162 = 0.0557. The hypothesis is 
accepted at a = 0.05. 

The critical value of the Chi-square statistic on the 
same sample, with three groups and hence 2 degrees of 
freedom, is 5.3936. This is less than 5.99, the x? 
critical value fora= 0.05. Again the hypothesis would be 
accepted, that is both tests accept that the random sample 
has been obtained from a population having exponential 
distribution with A=6. Using interpolation in tables of 
the incomplete gamma function, the critical value of the 
Chi-square statistic associated with a=0,0557 is found 
to be approximately 5.7724. For a=0.06 this approximate 
value is 5.6275. If we had used a=0.06 rather than 0,095 
the roe) test would have rejected the null hypothesis 


Since 0.0557 < 0.06, However the Chi-square test would 


have accepted the null hypothesis since 5,3936 is less 


29 


ne 


_ _ a 
ee ® 


a 
ae 
‘2m ) 


i 








than 5.6275. The Chi-square test will Beqects the null 
hypothesis at any a level which is greater than 0.06. 

In fact, the random sample for the present examples 
was generated from a population having an exponential distri- 
bution with mean 15. Although the critical level a=0.06 
is not commonly used, the Chi-square cannot detect that 
the random sample did not come from exponential distribution 
with mean 6, 

From the example in Chapter III, 1t was found that the 
null hypothesis was rejected at a=0.01, With the same data, 
with three groups, hence 2 degrees of freedom, under the 
null hypothesis the value of the Chi-square test statistic 
is 13.4075, This is greater than 10.6, the ep value at 
a=0785. snus, bovletvesté reject thewnul! hypothesis et 
a=0.05. The critical value of the Chi-square test 
associated with a=0.00001 is found to be approximately 
23.4, which means that if we had used a=0.00001, the Chi- 
square test would have accepted the null hypothesis, since 
Semte7oa <9 23.4. Of course, the critical level 1=0.00001 
is rarely used in practice. The acceptance to the null 
hypothesis by the Chi-square test at a=0.00001 is most 
likely caused by the fact of very low value of the expected 
frequencies in the fourth group of the sample that is 1,344. 

Suppose the null hypothesis is Ho: A~U(1i0, 18) with 
critical level a=0,05. Under this null hypothesis, the 
value of the Chi-square statistic is 5.320, which is less 


tian LOeG. In this case, the Chi-square test accepts the 
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null hypothesis, or in other words, the Chi-square test 
accept that the random sample was generated from a 
population having U(10, 18) distribution. Again in this 
case the Chi-square test has low asen CE esiiasic s in 
the fourth group namely 1,92. 


For the K-S test, under the null hypothesis 


(g) 
Uieme 18), the value of So oo.o20, SO P| D, > .395 | = 
0,00008, which is less than 0.05. The BE is test rejects 


the null hypothesis, which means rejects that the random 
sample was generated from a population having U(10, 18) 
distribution. 


The present examples suggest the K-S may be more 


=) 
powerful than the Chi-square test, at least in certain 
@eeco ere abions Ccenmductcd to explore this questicn sare 
discussed below. \ 

In the previous example, concerning the exponential 
distribution, the critical value of the K-S statistic in 
the continuous case at a=0.0557 is between ,304 and .338, 
which leads us to reject the null hypothesis that the 
Sample was generated from an EXP (A=6) distribution. At 
the same critical level, for example 0.0557 or 0.00008, 
grouping samples into intervals tends to lower the power, 
Thus, for grouped data (samples), the appropriate critical 
values are smaller than those tabulated for continuous 
case. Thus, use of tables of critical values of the K-S 


Ceeteeeeienot five a correct test. Therefore the need 


Lor wie er) Preocectire 1S evident. 
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VV. SIMULATION 


In order to get a better comparison between the three 
eoodness-of-fit tests previously mentioned, especially 
their powers, Simulation was used, The powers of the tests 
should be compared under the same conditions, namely at 
the same significance level and for the same null hypothesis. 
There are two procedures for this simulation, The first 
procedure is as follows: generate a random sample of size 
ieeom a population having distribution Beste) Specify 


acmatical level a, for example a, and hand tne erat Leal 


value 6, of the K-S statistic associated with a, and n. 

By uSing tables of the incomplete gamma function, one 

can calculate (at least approximately) the criticai values 
of the Chi-square statistics associated with O- Let C, 


Oy. Gage 


denote this critical value. These three values, o> ; 


1? 
are then used as the input to the simulation for computing 
the relative frequency of acceptance of Hy by the BSS (ey)! 
the K-S for continuous distribution and the Chi-square tests, 
respectively. Further, generate the random samples N times. 
The hypothesis to be tested is: H,: Eee = oA ee and 
er a) == Ise at level a,. 

Let oa oo D, denote the relative frequency of 


acceptance of Ho by the K-S, the K-S and the Chi-square 


(gf) 


recpectively, then 
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AN "number of times" (D. < 6) 
Se 

a "number of times" P [D. = oF leo, 

Pp, cs N 

a "number of times" CX" Ory Sisaersi ew Te) 
i N 


Both acceptances of K-S and Chi-square test can be programmed. 
However, it appears to be difficult to program Conover's 
procedure, To the knowledge of the author, a subroutine 
for this procedure is not available. 

Therefore the simulation has been done indirectly, 
uSing a second procedure, This can be described as follows: 
With the same random samples from the same ca and the 


Same number of groups then for 0 <d, < d, <1, 


P| D, 2 d, | oh | D > d, | . If the hypothesis is accepted 


at P [Dy > d, | =a>, then it will be accepted at 


p [D,, aoe | = a Since a, 2 O., . In other words the sample 


rates of acceptance of K-S is defined to be 


(g) 


"number of times'"' (D_ <TC) 


N where d is specified in 


aevance. Once d is fixed,. P >, gs | = a, can be computed 
from which the critical values for the other two tests 
associated with a, can be calculated. However, it will be 
difficult to obtain an approximate value for the critical 


values of the K-S test because of the limited ao values in 


the tables. Therefore, d is used to calculate the frequency 
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GumaeceuGance by both K-S tests. So, the inputs for the 
Ba procedure are C, ainderd, 

For the purpose of ee cindiation, eroups Siw lo wr andom 
numbers were generated from Exponential distributions with 
means A=6, 9, 12 and 15; random samples of size 14 were 
generated from Normal distributions with u=0, o=1, 3 and 
5; random samples of size 16 were generated from Uniform 
Gmemmemoutions UC10, 13), UC10O, 15) and U(10, 17). #£=The 
value of d was choosen arbitraryly to be d=,.3. The results 
of this simulation are summarized in table (5.1). 

The null hypothesis for the four Exponential cases was 
Re = EXP (A=9), with 4 groups given by the intervals (0 -3), 
(3 - 6), (6 - 9) and (9 - 100). By Conover's procedure 
Pep > d=.3 | = a = 0.0328, The null hypothesis for the 
three Normal cases was N(O, 1). Four groups were used 
with intervals (-~, -1), (-1, 0), (O, 1) and (1, -) and 
P | D 


> d=.3 | = a@ was found to be 0.0185. Finally, the 


n 1 


null hypothesis for the Uniform cases was U910, 15), with 
SeeeOups Clveneby the intervals (10, 11), (11,.12), (12, 13), 
ic, 44) , Gia, 17) ana P | Dy > d=.3 | = a was found to 
be OF022. The associated critical values for the Chi- 
square test are approximately 8.7517, 8.00, 13.245 in the 
EXPONENTIAL, NORMAL and UNIFORM cases, respectively. These 
three critical values together with the three a levels are 
then used in the input to the simulation. 

Simulation results give Dy =< - 97% fon Gie acceptance 


frequency of the K-S cest for the null ae = U(10, 15) 


Cg) 
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distmipm@tion, which mean (1,00 - .9797 ) = 0,0203 is the 
estimated rejection rate, in other words a, =0.0203. The 
inpue @, for the Uniform random cases is 0.022; the differ- 
ence 0.0017 between these is not Significant, From table 
(5.1) it can be seen that the corresponding difference for 
the Exponential case is 0.0028 and for the Normal case is 
0.0008, which are not significant, From the same table it 
can be seen that the significance level (for the null 


hypothesis) for both the Chi-square and K-S Vests in 


(g) 
the Uniform and Exponential cases, are very close. 

For example, in the Exponential case, this difference 
is only (.0356 - .0316) = ,004. Therefore these tests can 
pe ao2mmeca to De at the same a level. In the Normal case 
Peo Oupine were ddta ,“threwvabuce Of a,=0.0177. This gives 
a difference of 0.0008 compared with the input a, =0,0185, 
However, the critical value of the Chi-square statistic 
associated with a,=0.0185, and degrees of fresuen 5 FS S20 
which gives .9565 for the acceptance rate under null 
hypothesis. In other words the a value here is 0,0435, 
which is not sufficiently close to 0.0185 since the differ- 
emee 1s 0.0250. To get a closer value, the critical value 
of &.0 was adjusted to 10.50 which gave a,=0.0126. Thrs 
adjustment causes the other two acceptance rates for N(O, 9) 
and N(O, 25), to be larger. 

With this adjustment all the tests, Chi-square, K-s 
and K-S are at (nearly) the same a level, so comparison 


Ce) 


of their powers are easily performed. 
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Mor the U(10, 17) case the power of the Chi-square 


and the K- ame Ure 205.and 0.2426 .and for UC10, 13) both 


S 
E29) 
are equal to 1.00. It means that in the U(10, 17) case 


the K-S test appears to be more powerful than the Chi- 


(g) 
square test, whereas in the U(10, 13) case both tests have 
the same power. For the Exponential case with parameter 


6, 12 and 15 the power of the K-S test turns out to 


(g) 
be more powerful than the Chi-square test in these 
Exponential and Normal cases. Looking at the results of 
another simulation, using three samples, generated from 
Pegmere). UCOn 20) and U(O0;, 22) distributions, where the 
null hypothesis to be tested was the Exponential distribu- 
tion with mean A=9, which are also tabulated in table (5,1), 


the K-S appears to be mere powerful than the Chi-squere 


(g) 
vest. 
Ber the Nemmal case, the Chi-square test turns out to 


be more powerful than the K-S even though the expected 


fe) 
frequencies in the first and fourth groups were very low, 
Guy 2.220 and 2.218. Of course, the ee aoe 
Gon in = applications is only an approximation. This 
approximation gives better result for larger samples. It 
is a rule of thumb that the approximation can be used with 
confidence as long as every expected frequency is at least 
equal to 5. “Thiserrtesonould not be considered inflexible 
however. It appears to be a conservative value and the 
Chi-square approximation was reasonably accurate iin thas 
study, even for expected cell frequencies as small as 1.5 
(6). 
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feeca on the results of the simulation for Exponential 
cases where two of the four groups have low expected 
frequencies (namely 3.0315 and 2.2350), the approximation 
appears reasonably accurate, The estimated value of a, 
obtained from this simulation is a,=0.0316 which is close 
mo the "target," a,=0.0328. By contrast, for the Normal 
case, the approximation is not so accurate, Thus, it was 
adjusted to get a closer value, as mentioned before. This 
inaccurate approximation may be caused by the fact of small 
expected frequencies in the tail, 

Table (5,2) shows results of yet another simulation. 
By keeping the variance constant and letting the means vary, 


again the K-S test appears to be more powerful than 


(g) 
the Chi-square test. 

In all cases, without grouping the data, in other words, 
in the continuous case, the ordinary K-S test is the most 
powerful test. Existing tables for the K-S fee ree 
indicate the critical value 6=.3 corresponds to an a level 
somewhere between .10 and .15. To get the same, or at 
least close to the critical level used for the Sa IO) 
example 0.0328 for the Exponential case, d=.3 should be 
increased. In other words, the value of 6 for continuous 
case would be greater. Again, grouping the data and 


holding the same a level requires the critical value of the 


H-S test to be smaller than that tabulated, 
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VI, SUMMARY 


Based on the results of the simulations, grouping the 
data causes the K-S statistic to be stochastically 
smaller than that for the ordinary tabulated K-S 
Seaciooc, or continuous underlying distribution 
functions the K-S test is the most powerful of the 

three test considered. 

it is hard to draw a general conclusion related to 

the relative powers of the oe) and Chi~square 

tests. In some cases the SES test 1S more powerful 
than the Chi-square test; in other cases the reverse 

is true. It is suggested that the relative vowers be 
investigated in more detail. Further investigation is 
ersoenecadca@ to determine rules of thumb for the 
appropriate number of groups to be used. 

For small samples, the ordinary table of the K-S 
statistic cannot be used for grouped data. The extension 
of the K-S test suggested herein, should be used when 
the data have been grouped. Unfortunately, for sample 
sizes larger than 30 the calculation becomes tedious. 

It would be very worthwhile to program a subroutine of 
Conover's procedure. The availability of the subroutine 
would enable this test to be used without time consuming 
Caleulataons, and also possibly make the test available 


for larger sample sizes. 
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It is suggested that the possibility of developing a 
"quick and dirty" modification of the K-S table for use 
in discrete and grouped cae be investigated. 

This simulation also demonstrated the adequacy of x’ 
appmoximation, even when expected cell frequencies 

fell substantially below oOo. 

Care should be taken in computing De the maximum 
vertical distance between Fos) and SG). This 


maximum distance iS equal to the greatest value of 


F (b.) - S®(b.) where b. ranges over the set of 
O-rr Mee 2 i 


class boundaries. 
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